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USING  THEORETICAL  DESCRIPTORS  IN  STRUCTURAL  ACTIVITY  RELATIONSHIPS 
V.  A  REVIEW  OF  THE  THEORETICAL  PARAMETERS 


1.  INTRODUCTION 

The  estimation  and  prediction  of  physical  and  chemical  properties 
of  compounds  and  associated  activities  generated  a  great  deal  of  interest  in 
the  past  several  years.  A  principal  reason  for  this  has  been  the  need  to 
know  various  properties  quickly  and  often  before  the  compound  has  been  synthe¬ 
sized.  There  are  two  excellent  compilations  of  property  estimation  routines 
in  the  1 iterature. 3 

An  effort  is  underway  at  the  U.S.  Army  Chemical  Research,  Develop¬ 
ment  and  Engineering  Center  (CRDEC)  to  develop  the  capability  fcr  a  priori 
prediction  of  physical  and  chemical  properties.3  Although  many  of  the 
routines  in  the  above  compilations  are  useful,  the  unique  nature  of  the 
compounds  of  interest  often  requires  new  methods  to  be  developed  or  existing 
methods  to  be  modified. 

1 . 1  Linear  Free  Energy  Relationships. 

Although  it  has  been  theorized  for  quite  some  time  that  molecular 
structure  directly  impacts  physical  and  chemical  activities,  it  was  not  until 
Hammett  developed  the  concept  of  Linear  Free  Energy  Relationships  (LFER)  that 
the  relationship  was  quantified.4  Using  heuristic  thermodynamic  and  kinetic 
arguments,  Hammett  developed  a  theoretically  supportable  equation  that  related 
a  given  reactivity  to  substituent  effects. 

The  basic  premise  behind  LFER  is  shown  in  equation  1  where  the  equi¬ 
librium  constant  (Keq)  is  related  to  the  Gibbs  Free  Energy  (aG). 

aG  =  -2.30  RT  LOG  Keq  (1) 

Hammett  examined,  in  particular,  the  hydrolysis  of  benzoic  acid 
ester  and  the  effects  of  substitution  on  the  relative  rates  of  reaction. 

From  simple  substitution  of  equation  1,  he  was  able  to  derive  equation  2. 

K 

LOG  —  •=  pa  (2) 

><0 

The  terms  p  and  a  represent  electronic  effects  due  to  the  nature  of 
the  substituents  on  the  benzene  (a)  or  to  the  nature  of  the  reaction  (p). 

1 . 2  Quantitative  Structural  Activity  Relationships. 

Using  the  arguments  of  Hammett,  Hansch  expanded  the  concept  of  the 
LFER  to  increase  the  use  of  this  type  of  equation. $  Two  assumptions  were 
made:  the  activity  being  considered  was  directly  reined  to  the  aG,  and  the 
&G  was  directly  related  to  the  molecular  description. ^  This  relationship  is 
shown  in  equation  3. 


LOG  A  =  Fa-jX} 


(3) 
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where  represents  the  ith  molecular  descriptor/parameter  In  the  derived 
regression,  and  a-j  represents  the  corresponding  coefficients.  A  wide  variety 
of  parameters  have  been  used  in  the  past  to  achieve  the  most  consistent  corre¬ 
lations. 


1 . 3  Linear  Solvation  Energy  Relationships. 

Using  the  procedures  of  Quantitative  Structure  Activity  Relationship 
(QSAR)and  LFER,  Kamlet  and  Taft  developed  a  relationship  correlating  a  special¬ 
ized  set  of  parameters  with  a  large  number  of  solute/solvent  interactions.?*9 
The  relationship,  the  Generalized  Linear  Solvation  Energy  Relationship  (LSER), 
takes  the  form  shown  in  equation  4. 


Log  Property  =  Steric  Term  +  Polarizability  Term 
+  Hydrogen  Bonding  Term(s)  +  C 


(4) 


Kamlet  and  Taft  found  three  principal  factors  affecting  a  given  solute/ sol  vent 
interaction:  a.  steric  effects,  the  size  constraints  of  the  solute  and  the 
ability  of  the  solvent  to  accommodate  the  particular  size  of  the  solute;  b. 
polarizability/dipolarity,  the  ability  to  have  a  dipole  induced  in  a  molecule; 
and  c.  the  ability  to  initiate  or  accept  hydrogen  bonding.  In  following  the 
reasonings  of  QSAR  and  LFER.  a  series  of  parameters  were  developed  that  reflect 
each  of  these  general  factors.  The  parameters,  often  called  the  Kamlet-Taft 
or  sol vatochromic  parameters,  were  correlated  with  a  large  number  of  different 
types  of  solute/solvent  interactions. 9 

The  LSER  equation  with  the  appropriate  sol vatochromic  parameters  is 
shown  in  equation  5.  These  descriptors  are  explained  in  Table  1. 

LOG  Property  =  mVm  ♦  p**  +  a*  +  b-?  +  C  (5) 

1.4  The  Theoretical  LSER. 


The  descriptors  in  Table  1  were  a  tremendous  aid  in  advancing  the 
understanding  of  solute/solvent  interactions  from  a  physical-organic  standpoint. 
One  major  difficulty  in  this  development  is  the  nature  of  the  sol vatochromic 
parameters.  Each  of  these  parameters  are  inherently  empirical  in  nature, 
preventing  the  use  of  these  equations  for  a  prior  Drediction  and  estimation 
of  solute/ sol  vent  properties. 

Theoretically  derived  parameters  are  now  being  used  in  these  types 
of  relationships.  9  The  previous  reports  in  this  series  have  dealt  with 
development  of  the  Theorectical  LSER  (TLSER)  and  the  applications  to  the 
prediction  and  estimation  of  physical  and  chemical  properti es.  11-14 
report  is  an  attempt  to  summarize  the  TLSER  and  to  demonstrate  the  usefulness 
of  this  procedure.  The  generalized  TLSER  is  shown  in  equation  6. 

LOG  Property  =  mVmc  +  pirj  +  a1E:a  +  a2qH+  +  bjeb  +  b2q_  +  C  (6) 

The  parameters  are  shown  in  Table  1  and  fully  described  in  the  Results  section. 
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Table  1.  Symbols  Used 


MW  *  Molecular  Weight  (g/mol) 

dj  *  Liquid  density  (g/ml) 

Vm  *  Molar  volume 

tt*  *  Sol vatochromic  Polarizability  Term 

6  *  Sol vatochromic  Hydrogen  Bond  Basicity  Term 

a  «=  Sol  vatochromic  Hydrogen  Bond  Acidity  Term 

Vmc  *  Molecular  Volume  (A3) 

it i  *  Polarizability  Index 

=  Molecular  Orbital  Basicity  (kcals) 
ca  =  Molecular  Orbital  Acidity  (kcals) 

q_  =  Electrostatic  Basicity 

qH+  a  Electrostatic  Acidity 

Kaw  =  Octanol/Water  Partition  Coefficient 

EA  =  Electronic  Absorption  (UV-Vis) 

LC50  =  50%  Lethal  Concentration  (Konneman) 

EC50  *  50%  Effective  Concentration  (Microtox) 

ky5  =  HPLC  Retention  Capacity  in  75%  Methanol 
kgn  =  Hydrolysis  Rate  Constant 

C  =  Regression  Coefficient 


2.  EXPERIMENTATION 

2.1  Experimental  Data. 

All  experimental  data  were  taken  either  from  the  original  sources 
for  the  data  or  from  the  corresponding  report  by  Kamlet  and  Taft.  To  maintain 
as  much  consistency  as  possible,  where  the  TLSER  and  LSER  are  directly 
compared,  identical  data  sets  have  been  used.  The  a,  R,  **  and  molar  volumes 
were  taken  from  articles  of  Kamlet  and  Taft.8 

2.2  Calculations. 

All  calculations,  regressions,  (results  of  the  regressions  and  data 
appear  in  the  previous  reports  of  this  series).  All  calculations  were 
performed  on  a  yVAXII  running  under  the  VMS  Operating  Systems.  All  quantum 
calculations  and  geometric  optimizations  were  performed  using  the  MNDO  algo¬ 
rithm  within  the  MOPAC. 15*16  j^e  moieCualr  volumes  were  calculated  using  the 
procedure  of  Hopfinger  incorporated  in  the  MMADS  software. I7 ’l8  All  ^regres¬ 
sions  were  performed  uning  the  MINITAB  statistical  software  package. 


3.  RESULTS 

The  development  of  the  TLSER  can  be  envisioned  as  two  separate  goals. 
First,  the  developed  TLSER  parameters  should,  as  much  as  possible,  correlate 

♦MiNITAB  Ts  marketed  by  Mini  tab,  INC.,  State  College,  PA 
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one  for  one  with  the  LSER  parameters.  This  will  permit  the  estimation  of  the 
LSER  parameters.  There  have  been  some  attempts  at  this  although  the  correla¬ 
tions  obtained  are  of  questionable  value.10  This  could  be  due  to  the  level  of 
the  theoretical  calculations  as  well  as  the  parameters  used.  The  second  goal 
is  the  direct  implementation  of  the  theoretical  descriptors  into  the  TLSER 
equation,  bypassing  the  LSER  parameters.  In  fact,  this  goal  is  the  more 
important,  as  the  primary  goal  is  the  development  of  usable  methods  of  a 
priori  prediction  of  physical  and  chemical  properties.  There  have  been  quite 
a  few  correlations  of  theoretical  parameters  in  QSAR  equations  although, 
again,  the  correlations  have  usually  been  less  than  satisfactory  for  predic¬ 
tive  purposes.1’  The  following  sections  describe  the  TLSER  parameters,  compare 
them  to  the  LSER  parameters,  then  combine  all  of  them  into  direct  correlations 
with  the  solute/solvent  properties. 

Table  1  lists  the  abbreviations  used  to  refer  to  the  LSER  and  TLSER 
parameters  throughout  this  report.  All  aquations  are  reported  with  the  data 
set  size,  N,  and  two  descriptions  of  the  degree  of  fit:  the  standard  error 
of  the  estimate,  SEE  (sd);  and  the  correlation  coefficient,  R. 

3.1  The  TLSER  Descriptors. 

3.1.1  Molecular  Volume. 

The  first  term  in  the  LSER  equation  is  a  steric  or  cavity  term. 

This  descriptor  is  intended  as  a  measure  of  the  energy  required  to  create  a 
hole  in  the  solvent  matrix  sufficient  to  solvate  the  solute  molecule.  In 
dealing  with  multiple  solutes  in  a  single  solvent,  the  energy  required  to 
separate  the  solvent  molecules  a  given  distance  remains  constant,  but  the 
distance  required  to  separate  the  molecules  varies  with  the  size  of  the 
solute  molecule.  In  the  LSER,  the  steric  term  used  to  account  for  this 
phenomenon  is  the  Molar  Volume  (Vm,  MW/dj). 

In  the  past,  steric  terms  have  been  the  easiest  to  calculate, 
because  electronic  contributions  are  not  explicitly  considered  in  this  desc¬ 
riptor.  There  have  been  many  steric  terms  used  in  QSAR  equations  with  very 
good  results,  ranging  from  such  empirically  derived  descriptors  as  Molar 
Volume  and  Taft:s  E$  parameter  to  theoretically  derived  volumes  and  surface 
areas. ^  Using  computational  techniques,  it  is  possible  to  calculate  the 
volume  taken  up  by  a  molecule  in  space  by  adding  the  Van  der  Waal's  contribu¬ 
tions  for  each  atom  and  subtracting  out  the  overlap  volumes.  Figure  l  shows 
a  pictorial  representation  of  the  Molecular  Volume  of  a  methyl  chloride 
molecule. 
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In  theory,  then,  the  Vmr  and  the  Vm  should  correlate  well  with  each 
other.  Equation  7  shows  the  relationship  between  these  two  terms  for  80 
diverse  compounds. ^ 

Vm 

—  =  0.891 
100 


Vmc 

—  +  0. 

.100 


152 


(7) 


N  =  80  R  =0.9690  SEE=0.073 


From  equation  7,  it  can  be  seen  that  the  fit  with  the  molar  volume 
is  excellent,  indicating  that  the  Vmc  can  be  used  to  predict  and  to  replace 
the  V,,,. 


3.1.2  Polarizability  Index. 

The  remaining  terms  in  the  LSER,  and  therefore  the  TLSER,  are  elec¬ 
tronic  in  nature.  To  best  approximate  these  methods,  quantum  chemical  methods 
were  employed.  Several  methods  were  compared  for  computational  speed  and  the 
ability  to  generate  reasonable  geometries,  including  Gaussian82,  ZIND0,  MND0, 
and  AMPAC.  The  MND0  method  with  M0PAC  was  chosen  as  giving  the  most  reason¬ 
able  geometry  for  the  data  sets  employed  without  requiring  an  unreasonable 
amount  of  computer  time. 

The  polarizability  U*)  was  the  second  term  of  the  LSER  to  be 
replaced.  In  general,  this  term  describes  the  ability  of  the  electrons  in 
the  valence  shell  of  the  solute  to  be  polarized.  This  is  shown  in  Figure  2, 
where  the  electron  cloud  has  a  roughly  elliptical  shape  in  the  absence  of  any 
field.  As  an  electric  field  is  introduced,  the  electrons  polarize  themselves 
to  cause  a  dipole  moment  to  be  Induced  in  the  molecule.  In  normal  solution 
chemistry,  this  field  is  caused  by  solvents  or  other  compounds  with  high 
dipole  moments. 


Figure  2.  Polarizability 

M0PAC  is  capable  of  calculating  the  Polarization  Volume  (PV)  using 
the  output  of  a  MND0  calculation,^  In  effect,  the  PV  is  the  change  in  the 
Van  der  Waal's  volume  of  the  molecule  due  to  the  introduction  of  an  electrical 
field.  This  term,  with  units  of  A^,  has  a  high  correlation  with  the  Molecular 
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Volume  and,  in  effect,  is  a  combination  of  volume  and  to  generate. a  unitless 
descriptor,  the  Polarizability  Index  (711)  is  calculated  as  shown  in 

equation 


Polarization  Volume 

TTj  =  - 

Molecular  Volume 


(8) 


This  results  is  a  descriptor  which  is  solely  dependent  upon  the 
ability  of  the  electrons  to  move  throughout  the  entire  molecule.  As  such, 
one  would  expect  aromatic  compounds  to  be  among  the  highest  values  with 
double  and  triple  bonds  to  be  below  that.  Compounds  with  highly  electroneg¬ 
ative  atoms  would  be  expected  to  have  lower  nj  values,  and  those  with  third 
and  fourth  row  atoms  greater.  Table  2  shows  five  representative  *,  values, 
showing  this  comparison. 


Table  2.  Representative  Polarizability  Index  Values 


Compound 

*1 

Ethanol 

0.0927 

Propane 

0.0953 

Butyl  Chloride 

0.1036 

Benzene 

0.1204 

Pentachlorobenzene 

0.1331 

3.1.3  Basicity  Terms. 

Two  terms  were  required  to  adequately  correlate  with  the  b  teim  of 
the  LSER.  These  two  terms  are  the  Molecular  Orbital  Basicity  (ep)  and 
the  Electrostatic  Basicity  (q_).  Equation  9  shows  the  correlation  with  b. 


B  =  0.240  -  0.196eb  +  1.699q_ 

N  =  77  R  =  .9518  SEE  =  0.0777 


(9) 


3.1.4  Molecular  Orbital  Basicity. 

The  Molecular  Orbital  Basicity  can  be  envisioned  both  in  theoretical 
and  chemical  terms.  Equation  10  shows  the  relationship  for  this  term  to  the 
Highest  Occupied  Molecular  Orbital  (HOMO)  and  the  Lowest  Occupied  Molecular 
Orbital  (LUMO). 

eb  =  ^M®Water  “  ^^Substrate  (10) 

eLUM0  the  ener9>  of  the  LUMO  of  water,  and  EHOMq  is  the  HOMO  of  the 
substrate  or  solute.  Therefore,  the  ep  is  the  Interaction  energy  for  the 
covalent  transfer  of  electrons  from  the  substrate  to  water.  The  lower  the  ep, 
the  more  facile  the  exchange.  This  term  is  a  covalent  basicity  term  and  is 
shown  in  Figure  3.  Table  3  lists  representative  values  for  the  b,  ep,  and  qH. 
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Acid- Base 


Figure  3.  Molecular  Orbital  Basicity 


Table  3.  Representative  Values  of  Basicity 


Compound 

e  b 

q_ 

3 

Carbon  Tetrachloride 

18.6798 

0.070 

0.10 

Hexane 

17.4761 

0.022 

Butanol 

16.7365 

0.325 

0.45 

Benzene 

14.8338 

0.059 

0.10 

Hexamethyl  Phosphoramide 

15.4321 

0.650 

1.05 

According  to  Ligand  Field  Theory,  when  an  acid-base  interaction  (or 
any  reaction)  occurs,  the  HOMO  of  base  and  LUMO  of  the  acid  mix  to  form  an 
adduct  HOMO  and  an  adduct  LUMO. The  nearer  the  HOMO  and  LUMO  of  the  reac¬ 
tants  are  in  energy,  the  greater  the  degree  of  overlap  results.  The  greater 
degree  of  overlap  lowers  the  adduct  HOMO,  In  general,  the  lower  adduct  HOMO 
signifies  a  more  stable  product  and,  therefore,  a  more  likely  reaction. 

This  can  be  related  to  the  TLSER  parameters  and  to  the  LSER  B. 
Because  Eb  represents  the  difference  between  the  HOMO  of  the  substrate  and 
the  LUMO  of  water,  the  larger  the  difference,  the  lower  the  basicity.  This 
implies  that  an  acid-base  interation  would  be  less  likely  to  occur.  Therefore 
eb  should  have  a  negative  sign  in  the  correlation  for  p  and  an  opposite 
other  than  that  of  B  in  the  property  correlations.  Going  back  to  equation  9, 
it  can  be  seen  that  8  requires  a  negative  sign.  The  validity  of  an  opposite 
other  than  $  in  the  property  correlations  shall  be  shown  in  later  sections. 
Table  3  lists  a  representative  set  of  eb  and  e  values  of  five  compounds. 

3.1.5  Electrostatic  Basicity. 

The  second  basicity  term,  and  the  final  parameter  to  be  discussed  in 
depth  in  this  study,  is  the  Electrostatic  Basicity  (q_).  This  parameter  is 
the  absolute  value  of  the  most  negative  formal  charge  in  the  substrate. 

Where  Eb  represents  the  covalent  interaction  of  acid  base,  q_  represents 
the  charge-charge  attraction.  It  would  be  expected,  then,  as  q__  gets  larger 
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in  absolute  magnitude,  the  attraction  would  be  better,  and  the  substrate 
would  be  a  better  base.  The  s  should  then  vary  proportionally  to  q__.  This 
can  be  seen  in  equation  9  where  the  appropriate  coefficient  has  a  positive 
sign. 


3.1.6  Acidity  Terms. 

In  replacing  the  acidity  term,  a,  in  the  LSER,  an  identical 
algorithm  was  employed  for  the  basicity  terms.  Two  parameters  were  developed, 
the  Molecular  Orbital  Acidity  (ea)  and  the  Electrostatic  Acidity  (qH+). 

A  correlation  has  not  yet  been  made  between  a  with  ea  and  qH+.  Table  4 
lists  corresponding  ea,  qH+,  and  o  values  for  five  representative  compounds. 


Table  4.  Representative  Values  of  Acidity 


Compound 

ea 

qH+ 

a 

Methanol 

15.9831 

0.236 

0.33 

Acetone 

12.850 

0.023 

0.00 

Benzene 

12.5588 

0.059 

0.00 

Phenol 

12.4382 

0.193 

0.61 

Carbon  Tetrachloride 

15.2253 

0.000 

0.00 

3.1.7  Molecular  Orbital  Acidity. 

The  Molecular  Orbital  Acidity,  ea,  Is  analogous  to  eb*  Rather 
than  describing  the  flow  of  electrons  from  the  substrate  to  water,  it  describes 
the  flow  of  electrons  from  water  to  the  substrate.  The  ea  is  merely  the 
^ HOMO  water  subtracted  from  the  EhjmO  the  substrate.  This  is  shown  in 
equation  11. 


ea  3  LUM0$u5Strate  -  H0M0yater  (11) 

As  can  been  seen,  this  is  the  exact  opposite  of  eb»  with  the 
substrate  now  acting  as  the  acid  and  water  acting  as  the  base. 

3.1.8  Electrostatic  Acidity. 

The  Electrostatic  Acidity,  qH+,  represents  the  hydrogen  or  Bronstead 
acidity  in  the  molecule.  This  is  described  by  the  most  positive  formal  charge 
of  any  hydrogen  atom  in  the  substrate  molecule. 

3.2  Examples  of  the  TLSER. 

The  real  use  of  the  TLSER  parameters  is  not  so  much  in  being  able 
to  theoretically  validate  and  predict  the  LSER  parameters  but  rather  to  be 
able  to  be  used  in  relationships  with  actual  solute/solvent  interactions.  To 
provide  a  degree  of  comparison,  corresponding  LSER  correlations  will  also  be 
given  with  the  TLSER  correlations. 
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Several  different  solute/solvent  Interactions  were  examined.  Includ¬ 
ing  toxicological  properties,  physical  properties,  and  spectral  absorptions. 
The  following  sections  describe  each  interaction  in  depth. 

3.2.1  Octanol/Water  Partition  Coefficient. 

The  Octanol/Water  Partition  Coefficient  (Kow)  ^as  *5een  used  exten¬ 
sively  in  medicinal  chemistry  and  In  QSAR  equations.22*23  The  principal 
reason  for  this  has  been  the  degree  to  which  the  K0w  models  the  transport  of 
compounds  over  the  blood/brain  barrier  and  other  1 ipophll ic/hydrophil 1c 
partitionings.  As  such.  Kg*  has  become  the  principal  parameter  in  QSAR 
equations  Involving  biological  activities.^ 


Using  the  TLSER  parameters,  equation  12  was  developed  for  a  data 
set  of  64  compounds  of  diverse  classes.7*1^ 


LOG  Kow 


0.847  ( w* 10 )  +  1.73 


N  =  64  R  =  0.9566  SEE  =  0.3568 


5.415q_  -  3.960 

(12) 


A  similar  relationship  was  developed  by  Kamlet  and  Taft  using  the 
LSER  parameters  shown  in  equation  13.  It  is  of  note  that  Kamlet  and  Taft 
included  only  general  aliphatics  and  alcohols  but  did  not  Included  aromatics 
in  the  equation  below. 


LOG  Kow  =  0.10+2 


,  T  Vml 

.89  —  -  0.88it  -  3.62b 

.100 


N  «  63  R  =  .9890  SEE  =  0.1800 


(13) 


In  comparing  these  two  equations,  several  items  are  evident.  First, 
the  correlation  coefficient  (R)  is  significantly  higher  in  the  LSER  equation 
(13)  than  in  the  TLSER  equation  (12),  and  the  standard  error  of  the  estimate 
(SEE)  is  much  lower.  Two  factors  can  account  for  this  increased  fit.  First, 
the  TLSER  was  developed  for  a  gas-phase  molecule  in  a  vacuum  with  no  inter- 
molecular  interactions  taken  into  account.  This  is  a  major  shortcoming  of 
quantum  chemical  techniques  in  general.  The  LSER  parameters  were  developed 
in  solution,  and  take  into  account  those  solute  factors  the  TLSER  parameters 
cannot.  At  present,  the  exact  magnitude  these  errors  represent  is  not  evident. 
The  second  factor  relates  to  the  use  of  empirical  correction  terms.  Kamlet 
and  Taft  recognized  that  certain  classes  of  compounds  were  routinely  found 
as  outliners  in  their  LSER  relationships.  As  such,  they  developed  an  elabo¬ 
rate  set  of  correction  terms  to  bring  those  faulty  values  into  line.  A 
correlation  without  these  correction  factors  would  yield  a  result  much  worse 
than  the  TLSERs.  On  the  other  hand,  the  TLSER  has  not  had  any  corrections 
applied  to  any  of  the  descriptors  and,  nevertheless,  results  in  very  high 
correlations. 


With  respect  to  the  descriptors,  there  are  several  additional  trends 
that  are  evident.  First,  the  molecular  volume  and  molar  volume  contain  the 
same  Information.  This  can  be  seen  because  the  coefficients  are  the  same. 
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These  are  the  most  important  factors  in  describing  the  realationship.  This 
can  be  seen  from  Table  5,  which  lists  the  t-scores  for  each  of  the  variables 
in  Equation  12.  In  identification,  the  sign  of  the  coefficient  Is  important 
because  larger  compounds  prefer  the  octanol  layer,  giving  rise  to  a  larger 
KqW  result. 


Table  5.  t-Scores  for  Equation  12 


Parameter 


t-Score 


Vmc 
*1 
e  b 
fi¬ 


ll. 73 
-0.37 
2.76 
-11.63 


The  second  descriptor  in  each  equation,  w*and  *1 ,  also  contains 
obvious  similarities.  Although  the  two  parameters  do  not  correlate  well  with 
each  other,  as  was  previously  noted,  they  do  have  approximately  the  same  coef¬ 
ficient  with  some  sign.  In  both  equations,  this  coefficient  is  the  smaller, 
indicating  this  is  the  least  important  vaiable  in  describing  Ko#.  One  can 
see  from  Table  2  that  the  t»score  indicates  that  u,1s  not  significant  at 
the  95$  level  (t  gc=1.960).  The  it,  term  has  been  retained  for  comparison 
with  the  LSER  equation.  The  signs  of  the  polarizability  terms  suggest  that, 
because  it  is  easier  to  polarize  the  electron  cloud  and  induce  a  dipole,  the 
solubility  in  water  would  increase. 

The  final  descriptors,  those  dealing  with  the  basicity,  also  yield 
a  fair  amount  of  information.  First,  the  signs  agree  with  each  other  and 
with  chemical  intuition.  The  degree  of  basicity  should  be  proportional  to 
the  solubility  in  water  and,  therefore,  inversely  proportional  to  the  KQW. 

The  b  term  does  have  a  negative  coefficient,  indicating  that  an  increase  in 
basicity  lowers  the  LOG  K^.  The  TLSER  parameters,  and  q_  produce  a 
similar  trend.  As  previously  described,  eh  increases  with  a  decrease  in 
basicity.  Therefore,  the  coefficient  should  be  positive,  indicating  that  a 
decrease  in  covalent  basicity  increases  the  solubility  in  octanol.  The 
electrostatic  term,  q,  is  directly  proportional  to  the  basicity.  Therefore, 
the  coefficient  should  be  negative  as  shown  in  equation  12.  Equation  13 
indicates  b  to  be  the  single  most  important  descriptor  in  the  LSER.  Table  2 
shows  that  one  of  the  basicity  terms,  q_,  is  as  significant  as  the  molecular 
vol ume. 


3.2.2  Konneman's  Fish  Toxicity. 


A  second  type  of  solute/solvent  interaction,  one  completely  differ¬ 
ent  from  Kow,  is  the  application  of  the  TLSER  to  toxicity  and  biological 
activity.  The  incidence  of  toxicity  can  be  preceived  in  one  of  two  mechanisms. 
One  is  by  a  specific  receptor  lock-and-key  approach.  The  second  is  by  non¬ 
specific  general  toxicity  involving  multiple-potential  mechanisms.  This 
section  and  the  next  one  give  two  examples  of  the  applicability  of  the  TLSER 
to  nonspecific  toxicity. 
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Konneman  measured  the  LCgg  for  a  series  of  aliphatic  and  aromatic 
industrial  pollutants  on  guppies  [Poecilia  reticulata).  Due  to  the 
diverse  nature  of  the  compounds  measured,  a  single  specific  mechanism  would 
not  suffice.  Equation  14  shows  the  USER  for  the  LC^q  of  Konneman. 


LOG 


vmc 

LC50  =  -0.928  —  -  10. 

LlOO. 


557  (irI*10)  -  1 
N  =  32  R  =  0.9434  SEE  =  0.5736 


'  eb" 

.442  —  -  0. 

.  10 - 


443q i  +  18.082 

(14) 


LOG  LC50  = 


0.003**  +  3.2778  +  5.397 


N  =  24  R  =  0.9778  SEE  -  0.3152 


(15) 


As  with  the  KqW,  the  LSER  yields  a  significantly  (based  upon  the 
increased  R  and  the  decreased  SEE)  better  fit.  However,  the  coefficients  in 
this  case  are  quite  different.  The  only  possible  explanation  is  that  the 
additional  compounds  used  for  the  TLSER  were  significantly  different  from  the 
other  compounds. 


3.2.3  Microtox  Toxicity  Test. 

A  second  biologically  based  interaction  correlated  for  the  effec¬ 
tiveness  of  the  TLSER  descriptors  is  the  Microtox  test.26  The  Microtox 
test  is  being  used  as  a  cost-effective  prescreening  technique  for  locating 
organic  chemicals  toxic  to  fish.  In  this  test,  the  inhibition  of  biolumin¬ 
escence  in  Photobacterium  phoephoreum  is  used  to  determine  the  EC^g  (50% 
education  in  light  production  after  5  min).  Correlations  of  toxicant  EC5g 
to  Fhotobacterium  phcaphoreum  and  toxicant  EC50  to  the  fathead  minnow  have 
shown  fair  to  good  results. 


Equations  16  and  17  show  the  correlations  derived  from  using  the 
TLSER  and  the  LSER  parameters,  respectively.27*  The  corresponding  descrip¬ 
tors,  in  general,  correlate  well  with  each  other  in  terms  of  sign  and  absolute 
magnitude.  In  this  case,  the  correlations  of  the  two  parameter  sets  yield 
almost  the  same  degree  of  fit  based  upon  R  and  SEE. 


LOG 


vmc 

EC50  =  -4.067  —  -  4. 

100. 


147  (*1*10)  +  3. 902q_  -  2.777qH+  +  11.656 


N  =  25  R  =  0.9823  SEE  =  0.3323 


(16) 


*Faminl,  G.R.,  Using  Theoretical  Descriptors  in  Structure  Activity  Relation¬ 
ships  VI.  A  Comparison  of  QSAR  Methodologies,  U.S.  Army  Chemical  Research, 
Development  and  Engineering  Center,  Aberdeen  Proving  Ground,  M0,  unpublished 
data  July  1989,  UNCLASSIFIED  Report. 
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LOG  EC 


50 


-4.36 


0.979/  +  4.112$  -  1 . 798a  +  7.531 


N  *  25  R  -  0.9889  SEE  =  0.2634 


(17) 


3.2.4  UV-yisible  (UV-Vls)  Absorption  in  Ylides. 

The  examples  shown  to  this  point  have  been  solute  effects.  That 
Is,  the  solvent  environment  has  not  changed,  be  it  octanol/water,  a  micro¬ 
organism,  or  a  fish,  and  the  regressions  explain  the  phenomena  of  a  solute 
going  Into  or  through  these  systems.  Another  application  of  the  LSER  and  the 
TLSER  is  the  analogous  relationship,  understanding  solvent  effects.  The 
solute  remains  unchanged,  and  the  solvent  system  is  altered,  demonstrating 
the  analogous  relationship. 

The  LSER  parameters,  also  called  the  sol vatochromic  parameters, 
were  originally  developed  from  UV-Vis  spectral  shift  of  selected  indicators. 

As  such.  It  would  be  expected  that  the  LSER  would  yield  excellent  results. 
Molecular  orbital  (MO)  calculations  can  predict  neat,  spectral  abosrptions 
with  varying  degress  of  accuracy.  However,  MO  calculations  have  not  been 
capable  of  idenfitying  and  quantifying  solvent  effects  to  spectral  shifts  In 
most  cases. 

To  test  the  applicability  of  the  TLSER  parameters  to  solvent  effects 
in  general  and  UV-Vis  absorption  in  particular,  a  data  set  of  Dorohoi  was 
used.2®  The  solvent  effect  on  the  spectrum  of  the  pyridinium  ylide  (Figure  4) 
was  measured  for  23  solvents. 
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Figure  4.  Pyridinium  Ylide 


The  regressuion  using  the  significant  TLSER  parameters  is  shown  in 
equation  18.  Equation  19  shows  the  analogous  relationship  for  the  LSER 
parameters/ 

^FamTftTT  g.k.  ,  using  Theoretical  Descriptors  in  Structure  Activity  Relation¬ 
ships  VI.  A  Comparison  of  QSAR  Methodologies,  U.S.  Army  Chemical  Research, 
Development  and  Engineering  Center,  Aberdeen  Proving  Ground,  MD,  unpublished 
data  July  1989,  UNCLASSIFIED  Report. 
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EA  =  20545  -  2444.3  |>i*10]  +  3540 


ea 

10 


+  1430q_  -  383qH+ 


N  =  23  R  =  0.9508  SEE  =  307.5 


(18) 


EA  =  20617  +  239-rr*  +  1357$  +  1412a 
N  =  23  R  =  0.9529  SEE  =  295.1 


(19) 


These  regressions  show  that  although  the  LSER  (equation  19)  corre¬ 
lates  better,  the  increase  is  relatively  minor.  Unlike  the  previous  examples, 
several  of  the  comparison  are  not  valid.  The  basicity  values,  B  and  q_,  have 
approximately  the  same  value  and  same  sign.  Based  upon  a  positive  e  a  and  a 
negative  qH+,  the  TLSER  acidity  term  would  indicate  a  negative  correlation 
with  increasing  acidity.  The  LSER,  however,  has  a  positive  acidity  term,  a  . 
In  addition,  although  the  TLSER  and  LSER  have  roughly  the  same  magnitude  for 
the  polarizability  terms,  the  signs  are  reversed. 

3.2.5  High  Performance  Liquid  Chromatography  (HPLC)  Retention  Times 

(Hafkensheid  Capacity  Factors). 

Another  solute/solvent  interaction  that  can  be  correlated  with  the 
LSER  and  TLSER  parameters  is  chromatography.  In  either  liquid  or  gas  chroma¬ 
tography  (GC),  the  solute  establishes  an  equilibrium  between  the  eluent  and 
the  solid  phase.  The  retention  times  and  capacity  factors  are  dependent  upon 
the  established  equibrium.  As  shown  earlier,  the  TLSER  and  LSER  parameters 
describe  equilibrium  conditions  well,  and  therefore  should  correlate  with  many 
HPLC  and  GC  Indices. 


An  example  to  demonstrate  this  usefulness  is  the  reversed-phase, 
HPLC  retention  index  of  Hafkensheid  and  Tomlinson.29  The  capacity  factor  of 
a  75%  methanol -water  mixture  ^75)  was  determined  for  19  different  solutes. 
Equation  20  shows  the  TLSER  for  this  data  set,  and  equation  21  shows  the  LSER 
for  the  set.29 


Vmc 

L- 185  —  -  0.610  Ut*10)  -  2.290q_  -  1.077qH  + 

.  100  J 


LOG  k75  =  -0.083  +  1. 

N  =  19  R  =  0.9859  SEE  =  0.0729 
LOG  kyg  =  -0.552  +  0. 952 J  —  -  1.456b  -  0.209a 

L  100  J 

N  =  21  R  =  0.9938  SEE  =  0.0492 


(20) 


(21) 


As  can  be  seen,  the  LSER  gives  a  better  correlation  for  this  data  set.  The 
coefficients  for  the  two  sets  of  parameters  result  in  the  same  sign  and 
approximately  the  same  values.  In  both  equations,  the  basic  term  is  the 
single  most  important  descriptor. 
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3.2.6  Hydrolysis  of  Qrganophosphorus  Compounds. 

A  major  solute/solvent  interaction  of  interest  to  CRDEC  is  the 
rate  constant  for  the  hydrolysis  of  chemical  nerve  toxins.  Hydrolysis  of  the 
toxin  is  important  from  two  aspects.  First,  the  ability  of  the  decontamina¬ 
tion  materials  applied  to  the  toxin  to  neutralize  it,  and  second,  persistence 
of  the  toxin  in  the  environment  due  to  resistance  to  natural  hydrolysis.  As 
such,  it  is  important  to  be  able  to  predict  the  hydrolysis  of  potential 
toxins  to  assess  their  natural  persistence.  The  mechanism  for  this  reaction 
is  given  in  Figure  5.30*31 


OR 


Toxic 


+  OH 


V 


0 

JL 


OR 
I 

OH 

non~Toxic 


+  F 


Figure  5.  Mechanism  for  the  Base  Hydrolysis  of  G-Agents 


Equation  22  shows  the  TLSER  regression  foa  a  series  of  10  organo- 
phosphorus  compounds  of  the  G-Agent  type.  No  regression  was  possible  for 
the  Kamlet-Taft  parameters  due  to  the  above  empiricity  of  the  sol vatochromic 
parameters. 

rvmci 

k0H  =  -2814.4  -  97.7  —  U  874.1  (n^lO)  +  1256 

LiooJ 


(22) 


N  =  10  R  =  0.9859  SEE  =  26.9 


Several  phenomena  can  be  rationalized  from  equation  22.  First,  in 
examining  the  relative  importance  of  each  variable,  it  can  be  seen  from 
Table  6  that  the  MO  basicity  is  the  most  important  parameter  followed  by  Vm 
and  it.  At  first  glance,  the  mechanism  would  seem  to  indicate  the  solute, 
organophosphorus  compound  (OP),  would  be  operating  as  the  acid,  and  therefore 
ea  should  be  the  dominant  term.  This  can  be  rationalized  by  remembering 
that,  in  general,  as  the  E^q^o  decreases  (i.e.,  cb  increases),  the  E luMO 
decreases  (i.e.,  the  ea  decreases).  There  is  a  direct  but  inverse  rela¬ 
tionship  between  eb  and  ea»  as  one  Increases,  the  other  decreases. 

Table  6.  t-Scores  for  Equation  22 


Parameter 


t-Score 


VMC 
11 1 
c  b 


2.72 

2.45 

11.66 
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The  signs  and  magnitude  of  the  Vm  and  can  also  be  explained. 

As  the  OR  group  gets  larger,  it  blocks  the  phosphorus  from  Sn2  attack  by  the 
OH".  This  will  have  a  negative  effect  upon  the  rate  contant,  because  in 
this  data  set,  the  principal  variant  is  the  ester  organic  group  (OR). 
Therefore,  Vmc  will  be  directly  proportional  to  the  size  of  the  OR  group. 

In  the  TLSER,  then,  Vmc  would  be  expected  to  be  negative.  On  the  other  hand, 
the  ir,  has  a  positive  correlation  with  the  rate  constant.  The  OP  will  feel 
the  effecive  field  due  to  the  charge  on  the  OH,  and  a  dipole  will  be  induced 
as  the  OH  enters  the  reaction  sphere  of  the  OP.  This  Induced  dipole  aids  the 
hydrolysis  and  permits  the  base  to  react  in  a  move  facile  manner. 


4.  CONCLUSION 

This  report  shows  the  general  usefulness  of  the  TLSER  descriptors 
and  the  applicability  to  the  correlation  and  prediction  of  solute/solvent 
Interactions.  This  was  a  review  of  the  previous  reports  in  this  series  in 
which  each  descriptor  has  been  presented  individually.  Figure  6  presents  a 
summary  of  the  types  of  solute/solvent  interactions  currently  correlated  with 
the  TLSER  parameters. 


UV*Vls 

Absorptions 


Absorptions 


Figure  6.  Correlations  with  the  TLSER 


In  each  of  the  correlations  reported,  the  TLSER  parameters  yield 
a  slightly  lower  correlation  than  the  corresponding  LSER  parameters.  However, 
it  must  be  emphasized  that  the  Kamlet-Taft  LSER  parameter  uses  several 
correction  factors  that  increase  the  correlation  coefficient  and  hence,  the 
fit.  The  TLSER  parameters  use  no  such  correction  factors,  and  all  values  for 
the  parameters  are  as  they  would  appear  in  the  calculations.  The  major 
drawback  of  the  TLSER  parameters  is  that  intermolecular  interactions  are  not 
taken  into  account.  All  calculations  are  done  assuming  a  single  molecule  in 
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vacuum.  This  precludes  the  natural  Incorporation  of  crystal  and  proximity 
effects  that  are  present  in  liquids  and  solids  and  in  solution.  The  degree 
of  fit  of  the  USER  should  also  increase  as  the  level  of  the  theoretical 
methods  increase.  Due  to  computational  limitations,  high  level  initial 
calculations  are  not  feasible  on  compounds  of  this  size  at  this  time.  However, 
as  computational  resources  become  more  powerful.  Initial  electronic  results 
should  further  refine  the  TLSER. 

The  principal  use  of  the  TLSER  is  in  the  a  priori  prediction  of 
solute/solvent  interactions.  Using  any  of  the  correlations  in  this  study 
for  new  compounds  requires  only  that  the  MNDO  calculation  be  performed  and 
the  necessary  volume  calculation  be  done.  Unlike  the  USER  parameters, 
synthesis  is  not  required.  As  a  result,  the  TLSER  parameters  for  a  given 
compound  can  usually  be  calculated. 
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